Register-Pressure-Aware Instruction Scheduling Using Ant Colony Optimization

نویسندگان

چکیده

This paper describes a new approach to register-pressure-aware instruction scheduling, using Ant Colony Optimization (ACO) . ACO is nature-inspired optimization technique that researchers have successfully applied NP-hard sequencing problems like the Traveling Salesman Problem (TSP) and its derivatives. In this work, we describe an algorithm for solving long-standing compiler problem of balancing Instruction-Level Parallelism (ILP) Register Pressure (RP) in pre-allocation scheduling. Three different cost functions are studied estimating RP during The proposed implemented LLVM open-source compiler, performance evaluated experimentally on three machines with instruction-set architectures: Intel x86, ARM, AMD GPU. compared exact Branch-and-Bound (B&B) previous work. On x86 both algorithms relative LLVM's generic scheduler, while GPU, AMD's production scheduler. experimental results show SPECrate 2017 Floating Point, gives geometric-mean improvements 1.13% 1.25% execution speed respectively, Using PlaidML it improvement 7.14% approximately same execution-time as B&B algorithm, each outperforming other substantial number hard scheduling regions. better than many large instances times out on. Both outperform CPU

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Instruction Scheduling Using MAX −MIN Ant Colony Optimization

Instruction scheduling is a fundamental step for a mapping an application to a computational device. It takes a behavioral application specification and produces a schedule for the instructions onto a collection of system processing units. The objective is to minimize the completion time of the given application while effectively utilizing the computational resources. The instruction scheduling...

متن کامل

Process Scheduling Using Ant Colony Optimization Techniques

The growing availability of low cost microprocessors and the evolution of computing networks have enabled the construction of sophisticated distributed systems. The computing capacity of these systems motivated the adoption of clusters to build high performance solutions. The improvement of the process scheduling over clusters originated several proposals of scheduling and load balancing algori...

متن کامل

Tasks Scheduling using Ant Colony Optimization

Problem statement: Efficient scheduling of the tasks to heterogeneous processors for any application is critical in order to achieve high performance. Finding a feasible schedule for a given task set to a set of heterogeneous processors without exceeding the capacity of the processors, in general, is NP-Hard. Even if there are many conventional approaches available, people have been looking at ...

متن کامل

Using Ant Colony Optimization for Infrastructure Maintenance Scheduling

For the optimal planning of maintenance schedules for infrastructural buildings (bridges, tunnels, etc) in urban road systems not only the budget has to be considered but also the impact on traffic to avoid unnecessary traffic jams. In a current research project we develop an optimization tool for this multiobjective problem based on ant colony optimization. In each iteration, the ants produce ...

متن کامل

Multi-Agent Based Patient Scheduling Using Ant Colony Optimization

With the advent of Agent Technology, and challenges facing the health care sector as a result of scarcity of both medical personnel and resources and also the distributive and complex nature of the activities in the medical domain, there is need to use the new programming paradigm in order to come up with robust and reliable system that will solve problems bedeviling the health care sector espe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Architecture and Code Optimization

سال: 2022

ISSN: ['1544-3973', '1544-3566']

DOI: https://doi.org/10.1145/3505558